Binaural cue coding-Part II: Schemes and applications

نویسندگان

  • Christof Faller
  • Frank Baumgarte
چکیده

Binaural Cue Coding (BCC) is a method for multichannel spatial rendering based on one down-mixed audio channel and side information. The companion paper (Part I) covers the psychoacoustic fundamentals of this method and outlines principles for the design of BCC schemes. The BCC analysis and synthesis methods of Part I are motivated and presented in the framework of stereophonic audio coding. This paper, Part II, generalizes the basic BCC schemes presented in Part I. It includes BCC for multichannel signals and employs an enhanced set of perceptual spatial cues for BCC synthesis. A scheme for multichannel audio coding is presented. Moreover, a modified scheme is derived that allows flexible rendering of the spatial image at the receiver supporting dynamic control. All aspects of complete BCC encoder and decoder implementations are discussed, such as down-mixing of the input signals, low complexity estimation of the spatial cues, and quantization and coding of the side information. Application examples are given and the performance of the coder implementations are evaluated and discussed based on subjective listening test results.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Binaural cue coding-Part I: psychoacoustic fundamentals and design principles

Binaural Cue Coding (BCC) is a method for multichannel spatial rendering based on one down-mixed audio channel and BCC side information. The BCC side information has a low data rate and it is derived from the multichannel encoder input signal. A natural application of BCC is multichannel audio data rate reduction since only a single down-mixed audio channel needs to be transmitted. An alternati...

متن کامل

Coding of Spatial Audio Compatible with Different Playback Formats

Recently, various schemes were proposed for parametric coding of stereo and multi-channel audio signals. Binaural Cue Coding (BCC) is such a technique. It represents multi-channel audio signals as a single downmixed channel plus a small amount of side information. BCC can be applied to mono and stereo backwards compatible coding of multi-channel audio signals. In this paper, we propose a genera...

متن کامل

Parametric Coding of Spatial Audio

Recently, there has been a renewed interest in techniques for coding of stereo and multi-channel audio signals. Stereo and multichannel audio signals evoke an auditory spatial image in a listener. Thus, in addition to pure redundancy reduction, a receiver model which considers properties of spatial hearing may be used for reducing the bitrate. This has been done in previous techniques by consid...

متن کامل

Härmä and Faller Spatial Decomposition

Techniques where a stereo or a multichannel signal is decomposed into spatial source-labeled time-frequency slots by level, time-difference, and coherence metrics have become popular in recent years. Good examples are binaural cue coding and up/downmixing techniques. In the article, we will provide an overview and discuss parallel approaches in the field of array processing and blind source sep...

متن کامل

Robustness analysis for multi-channel hearing aid algorithms with binaural output by means of objective perceptual quality measures

Introduction According to the ITU-T P.835 recommendation, subjective quality evaluation of noise reduction schemes involves (i) the perceived quality of the speech signal, (ii) the quality of the background signal and (iii) the overall quality. In [7] it has been shown that these subjective measures are predictable by objective measures in the case of monaural noise reduction schemes. In this s...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • IEEE Trans. Speech and Audio Processing

دوره 11  شماره 

صفحات  -

تاریخ انتشار 2003